99 research outputs found
Co-evolution of RDF Datasets
Linking Data initiatives have fostered the publication of large number of RDF
datasets in the Linked Open Data (LOD) cloud, as well as the development of
query processing infrastructures to access these data in a federated fashion.
However, different experimental studies have shown that availability of LOD
datasets cannot be always ensured, being RDF data replication required for
envisioning reliable federated query frameworks. Albeit enhancing data
availability, RDF data replication requires synchronization and conflict
resolution when replicas and source datasets are allowed to change data over
time, i.e., co-evolution management needs to be provided to ensure consistency.
In this paper, we tackle the problem of RDF data co-evolution and devise an
approach for conflict resolution during co-evolution of RDF datasets. Our
proposed approach is property-oriented and allows for exploiting semantics
about RDF properties during co-evolution management. The quality of our
approach is empirically evaluated in different scenarios on the DBpedia-live
dataset. Experimental results suggest that proposed proposed techniques have a
positive impact on the quality of data in source datasets and replicas.Comment: 18 pages, 4 figures, Accepted in ICWE, 201
Measuring Accuracy of Triples in Knowledge Graphs
An increasing amount of large-scale knowledge graphs have been constructed in recent years. Those graphs are often created from text-based extraction, which could be very noisy. So far, cleaning knowledge graphs are often carried out by human experts and thus very inefficient. It is necessary to explore automatic methods for identifying and eliminating erroneous information. In order to achieve this, previous approaches primarily rely on internal information i.e. the knowledge graph itself. In this paper, we introduce an automatic approach, Triples Accuracy Assessment (TAA), for validating RDF triples (source triples) in a knowledge graph by finding consensus of matched triples (among target triples) from other knowledge graphs. TAA uses knowledge graph interlinks to find identical resources and apply different matching methods between the predicates of source triples and target triples. Then based on the matched triples, TAA calculates a confidence score to indicate the correctness of a source triple. In addition, we present an evaluation of our approach using the FactBench dataset for fact validation. Our findings show promising results for distinguishing between correct and wrong triples
Opportunistic linked data querying through approximate membership metadata
Between URI dereferencing and the SPARQL protocol lies a largely unexplored axis of possible interfaces to Linked Data, each with its own combination of trade-offs. One of these interfaces is Triple Pattern Fragments, which allows clients to execute SPARQL queries against low-cost servers, at the cost of higher bandwidth. Increasing a client's efficiency means lowering the number of requests, which can among others be achieved through additional metadata in responses. We noted that typical SPARQL query evaluations against Triple Pattern Fragments require a significant portion of membership subqueries, which check the presence of a specific triple, rather than a variable pattern. This paper studies the impact of providing approximate membership functions, i.e., Bloom filters and Golomb-coded sets, as extra metadata. In addition to reducing HTTP requests, such functions allow to achieve full result recall earlier when temporarily allowing lower precision. Half of the tested queries from a WatDiv benchmark test set could be executed with up to a third fewer HTTP requests with only marginally higher server cost. Query times, however, did not improve, likely due to slower metadata generation and transfer. This indicates that approximate membership functions can partly improve the client-side query process with minimal impact on the server and its interface
sameAs.cc: The Closure of 500M owl: sameAs Statements
The owl:sameAs predicate is an essential ingredient of the Semantic Web architecture. It allows parties to independently mint names, while at the same time ensuring that these parties are able to understand each other’s data. An online resource that collects all owl:sameAs statements on the Linked Open Data Cloud has therefore both practical impact (it helps data users and providers to find different names for the same entity) as well as analytical value (it reveals important aspects of the connectivity of the LOD Cloud). This paper presents sameAs.cc: the largest dataset of identity statements that has been gathered from the LOD Cloud to date. We describe an efficient approach for calculating and storing the full equivalence closure over this dataset. The dataset is published online, as well as a web service from which the data and its equivalence closure can be queried
Multidimensional integration of RDF datasets
Data providers have been uploading RDF datasets on the web to aid researchers and analysts in finding insights. These datasets, made available by different data providers, contain common characteristics that enable their integration. However, since each provider has their own data dictionary, identifying common concepts is not trivial and we require costly and complex entity resolution and transformation rules to perform such integration. In this paper, we propose a novel method, that given a set of independent RDF datasets, provides a multidimensional interpretation of these datasets and integrates them based on a common multidimensional space (if any) identified. To do so, our method first identifies potential dimensional and factual data on the input datasets and performs entity resolution to merge common dimensional and factual concepts. As a result, we generate a common multidimensional space and identify each input dataset as a cuboid of the resulting lattice. With such output, we are able to exploit open data with OLAP operators in a richer fashion than dealing with them separately.This research has been funded by the European Commission through the Erasmus Mundus Joint Doctorate Information Technologies for Business Intelligence-Doctoral College (IT4BI-DC) program.Peer ReviewedPostprint (author's final draft
Analyzing the Evolution of Vocabulary Terms and Their Impact on the LOD Cloud
Vocabularies are used for modeling data in Knowledge Graphs (KGs) like the Linked Open Data Cloud and Wikidata. During their lifetime, vocabularies are subject to changes. New terms are coined, while existing terms are modified or deprecated. We first quantify the amount and frequency of changes in vocabularies. Subsequently, we investigate to which extend and when the changes are adopted in the evolution of KGs. We conduct our experiments on three large-scale KGs: the Billion Triples Challenge datasets, the Dynamic Linked Data Observatory dataset, and Wikidata. Our results show that the change frequency of terms is rather low, but can have high impact due to the large amount of distributed graph data on the web. Furthermore, not all coined terms are used and most of the deprecated terms are still used by data publishers. The adoption time of terms coming from different vocabularies ranges from very fast (few days) to very slow (few years). Surprisingly, we could observe some adoptions before the vocabulary changes were published. Understanding the evolution of vocabulary terms is important to avoid wrong assumptions about the modeling status of data published on the web, which may result in difficulties when querying the data from distributed sources
Odorant-Dependent Generation of Nitric Oxide in Mammalian Olfactory Sensory Neurons
The gaseous signalling molecule nitric oxide (NO) is involved in various physiological processes including regulation of blood pressure, immunocytotoxicity and neurotransmission. In the mammalian olfactory bulb (OB), NO plays a role in the formation of olfactory memory evoked by pheromones as well as conventional odorants. While NO generated by the neuronal isoform of NO synthase (nNOS) regulates neurogenesis in the olfactory epithelium, NO has not been implicated in olfactory signal transduction. We now show the expression and function of the endothelial isoform of NO synthase (eNOS) in mature olfactory sensory neurons (OSNs) of adult mice. Using NO-sensitive micro electrodes, we show that stimulation liberates NO from isolated wild-type OSNs, but not from OSNs of eNOS deficient mice. Integrated electrophysiological recordings (electro-olfactograms or EOGs) from the olfactory epithelium of these mice show that NO plays a significant role in modulating adaptation. Evidence for the presence of eNOS in mature mammalian OSNs and its involvement in odorant adaptation implicates NO as an important new element involved in olfactory signal transduction. As a diffusible messenger, NO could also have additional functions related to cross adaptation, regeneration, and maintenance of MOE homeostasis
Suppression of grasshopper sound production by nitric oxide-releasing neurons of the central complex
The central complex of acridid grasshoppers integrates sensory information pertinent to reproduction-related acoustic communication. Activation of nitric oxide (NO)/cyclic GMP-signaling by injection of NO donors into the central complex of restrained Chorthippus biguttulus females suppresses muscarine-stimulated sound production. In contrast, sound production is released by aminoguanidine (AG)-mediated inhibition of nitric oxide synthase (NOS) in the central body, suggesting a basal release of NO that suppresses singing in this situation. Using anti-citrulline immunocytochemistry to detect recent NO production, subtypes of columnar neurons with somata located in the pars intercerebralis and tangential neurons with somata in the ventro-median protocerebrum were distinctly labeled. Their arborizations in the central body upper division overlap with expression patterns for NOS and with the site of injection where NO donors suppress sound production. Systemic application of AG increases the responsiveness of unrestrained females to male calling songs. Identical treatment with the NOS inhibitor that increased male song-stimulated sound production in females induced a marked reduction of citrulline accumulation in central complex columnar and tangential neurons. We conclude that behavioral situations that are unfavorable for sound production (like being restrained) activate NOS-expressing central body neurons to release NO and elevate the behavioral threshold for sound production in female grasshoppers
- …